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3 <110> APPLICANT: Benning, Christoph 

4 Sanda, Sherrie 

5 Yu, Bin 

7 <120> TITLE OF INVENTION: Compositions and Methods for the Synthesis and Subsequent 
Modification of 

8 Uridine - 5 - Diphosphosul f oquinovose (UDP - SQ) 







<130> FILE REFERENCE: MSU-04769 




















I* > 


1 o 


<140> CURRENT APPLICATION NUMBER: US/09/709 , 020A 










c — > 


lz 


<141> CURRENT FILING DATE: 


2000 


-11- 


08 


















1 9 


<160> NUMBER OF 


SEQ 


ID NOS: 


24 
























<170> SOFTWARE: 


Patentln version 3 . 


0 


















J- D 


<210> SEQ ID NO 


: 1 




























T *7 
J. / 


<211> LENGTH: 1134 




























1 ft 


<212> TYPE: DNA 
































<213> ORGANISM: 


Cy anoba c t e r i um 


Synechococcus 














*C J. 


<22 0> FEATURE: 






























22 


<221> NAME / KEY : 


CDS 




























23 


<222> LOCATION: 


(1) 


. . (1134) 
























25 


<400> SEQUENCE: 


1 




























26 


atg cgc ate get 


etc 


ttt 


acc 


gag 


acg 


ttc 


etc 


ccc 


aaa 


gtg 


gat 


ggc 


48 




27 


Met Arg lie Ala 


Leu 


Phe 


Thr 


Glu 


Thr 


Phe 


Leu 


Pro 


Lys 


Val 


Asp 


Gly 






28 


1 


5 










10 










15 








30 


ate gtc acg egg 


ctt 


egg 


cac 


acg 


gtc 


gat 


cac 


ctg 


cag 


cgt 


Ctt 


ggc 


96 




31 


lie Val Thr Arg 


Leu 


Arg 


His 


Thr 


Val 


Asp 


His 


Leu 


Gin 


Arg 


Leu 


Gly 






32 


20 










25 










30 










34 


cac ace gtc atg 


gtt 


ttt 


tgc 


ccc 


gac 


ggc 


999 


etc 


cgc 


gag 


cac 


aag 


144 




35 


His .Thr Val Met 


Val 


Phe 


Cys 


Pro 


Asp 


Gly 


Gly 


Leu 


Arg 


Glu 


His 


Lys 






36 


35 








40 










45 












38 


999 9Ct cga gtc 


tat 


999 


gtt 


aaa 


ggc 


ttt 


ccg 


eta 


ccg 


etc 


tat 


ccc 


192 




39 


Gly Ala Arg Val 


Tyr 


Gly 


Val 


Lys 


Gly 


Phe 


Pro 


Leu 


Pro 


Leu 


Tyr 


Pro 






40 


50 






55 










60 














42 


gag ctg aag eta 


get 


ttt 


ccg 


ttg 


ccg 


aaa 


gtg 


gga 


aaa 


gee 


ttg 


gag 


240 




43 


Glu Leu Lys Leu 


Ala 


Phe 


Pro 


Leu 


Pro 


Lys 


Val 


Gly 


Lys 


Ala 


Leu 


Glu 






44 


65 




70 










75 










8 0 






46 


egg ttc egg ccc 


gac 


ctg 


ate 


cac 


gtg 


gtc 


aat 


ccg 


get 


gtg 


ttg 


ggg 


288 




47 


Arg Phe Arg Pro 


Asp 


Leu 


He 


His 


Val 


Val 


Asn 


Pro 


Ala 


Val 


Leu 


Gly 






48 




85 










90 










95 








50 


ttg 99 c 99C ate 


tac 


tat 


gee 


aag 


gcg 


eta 


aat 


gtg 


cca 


etc 


gtg 


gcg 


336 




51 


Leu Gly Gly lie 


Tyr 


Tyr 


Ala 


Lys 


Ala 


Leu 


Asn 


Val 


Pro 


Leu 


Val 


Ala 






52 


100 










105 










110 










54 


tec tat cac acc 


cat 


ttg 


ccg 


aaa 


tac 


ctt 


gag 


cat 


tac 


ggg 


ctg 


ggg 


.3 84 




55 


Ser Tyr His Thr 


His 


Leu 


Pro 


Lys 


Tyr 


Leu 


Glu 


His 


Tyr 


Gly 


Leu 


Gly 






56 


115 








120 










125 












58 


gtc ttg gag ggg 


gtg 


etc 


tgg 


gaa 


ttg 


ctg 


aag 


ctg 


gcg 


cat 


aac 


caa 


432 
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59 Val Leu Glu Gly Val Leu Trp Glu Leu Leu Lys Leu Ala His Asn Gin 

60 130 135 140 

62 gca gcg ate aac etc tgt act tea ace gcg atg gtg cag gag ctg aca 480 

63 Ala Ala lie Asn Leu Cys Thr Ser Thr Ala Met Val Gin Glu Leu Thr 

64 145 150 155 160 

66 gat cac ggc att gag cac tgt tgc etc tgg cag cga gga gtg gat ace 52 8 

67 Asp His Gly lie Glu His Cys Cys Leu Trp Gin Arg Gly Val Asp Thr 

68 165 170 175 

70 gag acc ttt egg cca gac ttg get act get gcg atg cgc gat cgc etc 576 

71 Glu Thr Phe Arg Pro Asp Leu Ala Thr Ala Ala Met Arg Asp Arg Leu 

72 180 185 190 

74 agt ggc ggt aag ccc act gcg ccc ttg ttg etc tac gtc gga cgc etc 624 

75 Ser Gly Gly Lys Pro Thr Ala Pro Leu Leu Leu Tyr Val Gly Arg Leu 

76 195 200 205 

78 tea gee gag aag caa ate gat cgc ctg cga ccc att ttg gat gee aat 672 

79 Ser Ala Glu Lys Gin lie Asp Arg Leu Arg Pro lie Leu Asp Ala Asn 

80 210 215 220 

82 cct gag get tgc ttg gee ttg gtc ggc gat ggc ccg cat egg gee gaa 72 0 

83 Pro Glu Ala Cys Leu Ala Leu Val Gly Asp Gly Pro. His Arg Ala Glu 

84 225 230 235 240 

86 eta gag caa ttg ttt get ggc acc cag acg cag ttc att ggc tat ctg 768 

87 Leu Glu Gin Leu Phe Ala Gly Thr Gin Thr Gin Phe lie Gly Tyr Leu 

88 245 250 255 

90 cat ggg gaa cag eta ggg gcg gec tac get tct get gac get ttt gtc 816 

91 His Gly Glu Gin Leu Gly Ala Ala Tyr Ala Ser Ala Asp Ala Phe Val 

92 260 265 270 

94 ttt ccc tec egg acc gaa acc etc ggt eta gtc ttg ctg gaa gee atg 864 

95 Phe Pro Ser Arg Thr Glu Thr Leu Gly Leu Val Leu Leu Glu Ala Met 

96 275 280 285 

98 gca gcg ggt tgt ccg gtc gtg gcg gec aat tec ggt ggc att ccc gat 912 

99 Ala Ala Gly Cys Pro Val Val Ala Ala Asn Ser Gly Gly lie Pro Asp 

100 290 295 300 

102 att gtc age gac ggc att aat ggt ttc ctg ttc gat cct gag gat gaa 960 

103 lie Val Ser Asp Gly lie Asn Gly Phe Leu Phe Asp Pro Glu Asp Glu 

104 305 310 315 320 

106 caa ggg gcg ate get gcg att cag cgc ttg ttg get aac cct gca gag 1008 

107 Gin Gly Ala lie Ala Ala lie Gin Arg Leu Leu Ala Asn Pro Ala Glu 

108 325 330 335 

110 cgc gag att eta cgc caa gcg get cgt caa gaa gee gaa cgc tgg age 1056 

111 Arg Glu lie Leu Arg Gin Ala Ala Arg Gin Glu Ala Glu Arg Trp Ser 

112 340 345 350 

114 tgg aac gca gee acg cgc caa etc cag gac tac tac tgc gag gtg ttg 1104 

115 Trp Asn Ala Ala Thr Arg Gin Leu Gin Asp Tyr Tyr Cys Glu Val Leu 

116 355 360 365 

118 gca gat ggt tgc tta ccc tta gcg gec tga 1134 

119 Ala Asp Gly Cys Leu Pro Leu Ala Ala 

120 . 370 375 

123 <210> SEQ ID NO: 2 

124 <211> LENGTH: 377 
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125 <212> TYPE: PRT 

126 <213> ORGANISM: Cyanobacterium Synechococcus 
128 <400> SEQUENCE: 2 

130 Met Arg lie Ala Leu Phe Thr Glu Thr Phe Leu Pro Lys Val Asp Gly 

131 15 10 15 

134 lie Val Thr Arg Leu Arg His Thr Val Asp His Leu Gin Arg Leu Gly 

135 20 25 30 

138 His Thr Val Met Val Phe Cys Pro Asp Gly Gly Leu Arg Glu His Lys 

139 35 40 45 

142 Gly Ala Arg Val Tyr Gly Val Lys Gly Phe Pro Leu Pro Leu Tyr Pro 

143 50 55 60 

146 Glu Leu Lys Leu Ala Phe Pro Leu Pro Lys Val Gly Lys Ala Leu Glu 

147 65 70 75 80 

150 Arg Phe Arg Pro Asp Leu lie His Val Val Asn Pro Ala Val Leu Gly 

151 85 90 95 

154 Leu Gly Gly He Tyr Tyr Ala Lys Ala Leu Asn Val Pro Leu Val Ala 

155 100 105 110 

158 Ser Tyr His Thr His Leu Pro Lys Tyr Leu Glu His Tyr Gly Leu Gly 

159 115 120 125 

162 Val Leu Glu Gly Val Leu Trp Glu Leu Leu Lys Leu Ala His Asn Gin 

163 130 135 140 

166 Ala Ala He Asn Leu Cys Thr Ser Thr Ala Met Val Gin Glu Leu Thr 

167 145 150 155 160 

170 Asp His Gly He Glu His Cys Cys Leu Trp Gin Arg Gly Val Asp Thr 

171 165 170 175 

174 Glu Thr Phe Arg Pro Asp Leu Ala Thr Ala Ala Met Arg Asp Arg Leu 

175 180 185 190 

178 Ser Gly Gly Lys Pro Thr Ala Pro Leu Leu Leu Tyr Val Gly Arg Leu 

179 195 200 205 

182 Ser Ala Glu Lys Gin He Asp Arg Leu Arg Pro He Leu Asp Ala Asn 

183 210 215 220 



186 


Pro 


Glu Ala 


Cys 


Leu 


Ala 


Leu 


Val 


Gly 


Asp 


Gly 


Pro 


His 


Arg 


Ala 


Glu 


187 


225 








230 










235 










240 


190 


Leu 


Glu Gin 


Leu 


Phe 


Ala 


Gly 


Thr 


Gin 


Thr 


Gin 


Phe 


He 


Gly 


Tyr 


Leu 


191 








245 










250 










255 




194 


His 


Gly Glu 


Gin 


Leu 


Gly 


Ala 


Ala 


Tyr 


Ala 


Ser 


Ala 


Asp 


Ala 


Phe 


Val 


195 






260 










265 










270 






198 


Phe 


Pro Ser 


Arg 


Thr 


Glu 


Thr 


Leu 


Gly 


Leu 


Val 


Leu 


Leu 


Glu 


Ala 


Met 


199 




275 










280 










285 








202 


Ala 


Ala Gly 


Cys 


Pro 


Val 


Val 


Ala 


Ala 


Asn 


Ser Gly Gly 


He 


Pro 


Asp 


203 




290 








295 










300 










206 


He 


Val Ser 


Asp 


Gly 


He 


Asn 


Gly 


Phe 


Leu 


Phe 


Asp 


Pro 


Glu 


Asp 


Glu 


207 


305 








310 










315 










320 


210 


Gin 


Gly Ala 


He 


Ala 


Ala 


He 


Gin 


Arg 


Leu 


Leu 


Ala 


Asn 


Pro 


Ala 


Glu 



211 325 330 335 

214 Arg Glu He Leu Arg Gin Ala Ala Arg Gin Glu Ala Glu Arg Trp Ser 

215 340 345 350 

218 Trp Asn Ala Ala Thr Arg Gin Leu Gin Asp Tyr Tyr Cys Glu Val Leu 

219 355 360 365 
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222 Ala Asp Gly Cys Leu Pro Leu Ala Ala 

223 370 375 

226 <210> SEQ ID NO: 3 

227 <211> LENGTH: 2 979 

228 <212> TYPE: DNA 

229 <213> ORGANISM: Cyanobacterium Synechococcus 
.231 <400> SEQUENCE: 3 

232 ctacacgtta ccttccggta ctggaaacag tcgtttaatc aaccaattga ttggtcccaa 60 

234 aacatgaact ttctttttcc tccagaacca aatcgctgca ctgtactgtt cattgcgtat 120 

236 ctttgtcgtt gctgctctcc aatcatattt ctcggtctct tctcttgccg cttttccaat 180 

238 gatctctctt gtttcacggt cgtgcagtaa agttctcagt tttgtcacgc aatcttcaac 240 

240 atctccaggg ttgaacaaaa atccggtttt tccctcctga tgaaaacatc agaatcagaa 300 

242 aaccacaagc tcaatatagg ttgacccata agaacaatca atgcaagatc attttgtgta 360 

244 ccagtctatg attgaataaa gtttcagttc ggttacagct cgcttataag aaaattggca 42 0 

246 gaaattgttt tttcaaccat ttcggttcgg ttgatatgct catcaatatg gtttggcagt 480 

248 taattgtaat tcagataatt cactgacctg atcttcaggg attatatcag ggattccacc 540 

250 ggcacgggcc gcgacgacag gaagtcctga agacattgct tcaagaacca caaggccaag 600 

252 tgtctccgac tctgatggca tcacaaacac atctccactt gcgtaagctt gtgagagttc 660 

254 atcgccttgt aacgttccag tgaaaaccgc tggcattccg gtaaacaact tctcaagatc 72 0 

256 ctctctttaa gaaaacgaaa cagataaaca aaattacaat gttgttgact agaaatcttc 780 

258 agataacaat atggccaatc tttaacaaaa ctagtacttg tatggtccat ctccaatgaa 840 

260 agcaatccga gcttcaggta atttgtccat tacactgcac acaaatttct caatatcaaa 900 

262 attcgataca ccacttaaaa gaagtgagtc cagtttatac aaaattctaa cctctttaaa 960 

264 agctccaaac tcttttctac gccaatgcga cctacatgaa tcactagtgg cttttctggt 1020 

266 tcgccattac tgttaattac aaaatattaa acatcaagat tagcgtggaa agtatcattg 1080 

268 tttttaatgc atataaaaga aacgtatatt ctattcttgc ctcagtctta tacgcatttc 1140 

270 ttgagaacgg aaacggggat tgaagctttc tgaatcgaca cccttattcc aaagtcgaag 1200 

272 ttgattagct tcaatatatg aaaaaaagaa gaagaaaatg taagttttga acaatcatag 1260 

274 agcttgtaca aaaatgtaat gtatgatctt tcttcacctg cagttgcacc agctgctata 1320 

276 agatcttttc caatggcagc agaaggaact aatgtaagat cagccgctct gtgaaggaac 1380 

278 cctgataaaa gcatattcaa gtttagtttc atattataca tcacaataaa ccagaaaaag 1440 

280 aagaaggaaa ttttgacatt tgaaaagcgg gttttacata cttattatag accacattgg 1500 

282 ttttaccaac caactaaaag tgtatcttgg gatgtatctg cacatcaaca caagcttaat 1560 

284 cttagacaaa aattttttta taacaacatt gtgaaatgag gcagaaaaag gtacttacac 1620 

286 agggacgtgt gtgtggtaag acattactat tggtacagat agcatttttg ctattgccag 1680 

288 agcaccaaag acctaaaaat tttagtcagg gaaaaaagag agtcaagttc tggattctct 1740 

290 ccagttcact ggtcttgcta atgttttagt aatgtgaatt cttgagggat ttaccataac 1800 

292 tccgggagat gaagcgtgta taatgtcagg cttaaaccgt gcaatttcag agatgattct 1860 

294 tggactaagc gcaagcgaga gtggaacctt ttggtaataa ggacaaggga agctacagaa 1920 

296 gagaagaaga aattagcgat attaccaaat agagaacatc cagtgagtaa actaaatggt 1980 

298 gctaccttct tgatccaatg actctggctc cataaaactc ttcaggaaca ccttcatgtg 2040 

300 tcgtcacgac tataacctat gaagcaaaaa agttattaaa aaaaaaaaaa aaggaacagt 2100 

302 taacacttgt caagtaattc taattctgga aacagttact tatgagctga ctgaaaagat 2160 

304 acttaagttg aagaatgaga tagtaaaaga agaaacctcg tctcccattt cacggaggta 2220 

306 tctaatgaaa ttctggaatc tgtttttgta gccggataca tagctgcaac aaaatcaaag 2280 

308 agagaatcac ttccaataat aacatgacat ataataaaag cttttggtca atggatcggt 2340 

310 gattccgaga atcttgggat attcacaact aaaatctgac aactttgact caaacaaatc 2400 

312 ctgaatgtaa ttggttttaa cgatctacta tataatttgc taaattggtg gtgtagcaaa 2460 

314 ttcatacatt agcgagtatc tcttcataaa ataaatgtaa cgatcaaatc gaaagaaaaa 2520 
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316 aacattacag gaaaaagtca accaaggaaa aaaatgagta gaatctgttt tcacagagac 2580 

318 atttcgtcga acacaaaaca agcaaaaaaa gaacactgtg aagaagactt acgcaaaggg 264 0 

320 agaaggctca acaaagagag caattctcct aggcttagag agcgactcag gatcgagaag 2700 

322 cggcgcatca atctccgatt cgtcatcttc tctgacttga gtaatagtca tatcgttgga 2760 

324 cccagaaaca gcttcctttg taatcactcc acagaagcgg agcttgcttt tcttgctaat 2 820 

326 gggtaatctc cggtgaccaa acgaaagagg ggaatgaaga acaaaagaag aagacctggg 2 880 

328 aggagaacaa gaggttgcag aggaagaaga acaagtgtta gtcgtgctag gaagcaaatg 2940 

330 aggaggtata gagagattta tagaagaaag agtcgtcat 2979 

333 <210> SEQ ID NO: 4 

334 <211> LENGTH: 2537 

335 <212> TYPE: DNA 

336 <213> ORGANISM: Arabidopsis thaliana 

338 <400> SEQUENCE: 4 

339 tcatattttg aaaaagcact tcttcttctt cttcttcttc ttcttcttct tcttcttccc 60 
341 atcaaaaagt gctagtggca agaccgcgca tattccagta acaaagttta aattcaaaac 12 0 
343 tatggaacac aatttgtccc atgttttgtt gcctgaatta aacgaccata tcacttggtt 180 
345 cggccacttc atctcggata cgcacaagcg gttgtcaagg ttgtaaaaga cgattttgta 240 
347 aggatctaca ttggcaaagg gagctttaca gacgacttga aaagtttcgg tgtgaagatt 300 
349 gaaaggtaaa actttggttt ctttgcactc gqtgaaccaa taaagcgacc catctagata 36 0 
351 cacaggtgcg gggtaagcag caatccgata aggagcagcg agagtgacat acctccaagc 42 0 
353 attggtacta aagtcgaaaa cttcgcatgt agtagcgttt tttaggccta tttctgcaga 480 
355 gttgtataac caaacgggct tgtatgtgcc cctgaatttg tctttaccga atccaagcat 540 
357 aaagaatttg cgtttaagct tgtagtaacc atctcgtaag tcgatcatga gtttttgata 600 
359 atcgcaaaga ggaagaggcc gataccatct agtggtcggg ttgaccacat aaccggattt 660 
361 gtcgtgattg taaagacaaa cgagaccgtc acaactacta tgtgaaacta agtacagtac 72 0 
363 gttatccttc tcccaaggag tagggatctt gaacgttgat gatgaaccca actgcaatgt 780 
365 tcttaaagat tctatggtcg ggtttataac atggtgagga aatacagaca ccatcagaac 840 
367 atctggatct cctgattgct gacgatgttt caactaaaat tgtatacatt tttattttat 900 
369 taaaaagata aatatcattt tgttggaatt ggaaaacaaa atctaaaaga taactttaga 960 
371 tatgtaaggg tcgcatatgt atttgattac attgctatga gtatatgact cacttttcca 1020 
3 73 ataatgaaaa aaataaagat gttgagtctt actaattaag ggtaaataca aaatttctga 1080 
375 tcattaatac aaagaaaaag cctctcataa gccaaagcca ttttactcgc cgtaaacatc 1140 
377 tccgcagcat actctttaca cttccttcct ctctccgcca atctctccgc tccttcagcc 1200 
379 acagcaacct ccataaccgc cgtcaaagct tccacatttg gcgcaaacat aaacccaaac 1260 
381 tcatcattca ccactatagt cctctttatg cttgcgtacc tagatgccat cactggttta 1320 
383 ccactcaaca tcgcttccat taacgttaaa tcaagacctt gtggtctaag cgttggattc 1380 
385 acgaacaaat cgatcccgtt gtagaaaccc ttgagctcat ttgggtttag agatcccaaa 1440 
387 atggaaactt tttcccctaa ttccttgtaa cgttgctccc atggccctga tccagctact 1500 
389 acgaggtaaa catttgaata cgtttggatt attttcgcga aagcttcgaa gagcaatgga 1560 
391 tgtcctttgt ctttgactaa tctcccagca gctcctaaaa caatcgctga tgagttttct 1620 
393 ggtaacccta attttgacct aaacagagta cgtagcttct tgtctgatgt gaatccgttc 1680 
3 95 tcgtcgactc cattgaggat cacatgaacc cttttctcag ggatttggta aacgtctctt 1740 
397 agcatttctc cgcagctatc gctgatagcg atgtggtgag cgtagttgtg gaagaatctg 1800 
399 atttcgtcca gtatcttggg aagcacggca ccgtatagac ttgcattgaa gccttgtgat 1860 
401 cttggttcgt ctggtttacg gatcaggtct tggtaaatac ttgattgtaa gctctctaac 1920 
403 gcaatgccgt gccaggatac agcgaggttt ggaacctccc gggcgatcca gtgaggtaaa 1980 
405 gcaacacttt cagagtgaac cgcatcgaaa ggttctttct tgttttcttc ttggtaaagc 2040 
407 tcccatgcct tgttgtaccg ccattttccg ggctccgcgt cgccgtgaga atggattata 2100 
409 gggtaaatga tttggtcgga aacagggggg attttgttgg tttcagggga ttggtctaag 2160 
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